Picture for Qingming Huang

Qingming Huang

University of Chinese Academy of Sciences, Key Lab of Intell. Info. Process., Inst. of Comput. Tech., Chinese Academy of Sciences, Peng Cheng Laboratory

Training-Free Composed Video Retrieval via Visual Representation-Guided Video-LLM Reasoning

Add code
Jun 01, 2026
Viaarxiv icon

Understanding-Enhanced Model Collaboration for Long-Tailed Egocentric Mistake Detection

Add code
Jun 01, 2026
Viaarxiv icon

The Bridge-Garden Dilemma in LLM Distillation: Why Mixing Hard and Soft Labels Works

Add code
May 25, 2026
Viaarxiv icon

Localization then Neutralization: Gradient-guided Token Suppression against Visual Prompt Injection Attack

Add code
May 24, 2026
Viaarxiv icon

Foresee-to-Ground: From Predictive Temporal Perception to Evidence-Driven Reasoning for Video Temporal Grounding

Add code
May 21, 2026
Viaarxiv icon

The Devil is in the Condition Numbers: Why is GLU Better than non-GLU Structure?

Add code
May 20, 2026
Viaarxiv icon

CoSyncDiT: Cognitive Synchronous Diffusion Transformer for Movie Dubbing

Add code
Apr 14, 2026
Viaarxiv icon

From Static to Dynamic: Exploring Self-supervised Image-to-Video Representation Transfer Learning

Add code
Mar 27, 2026
Viaarxiv icon

Locate-then-Sparsify: Attribution Guided Sparse Strategy for Visual Hallucination Mitigation

Add code
Mar 17, 2026
Viaarxiv icon

Guiding Diffusion-based Reconstruction with Contrastive Signals for Balanced Visual Representation

Add code
Mar 05, 2026
Viaarxiv icon